Towards speaker independent continuous speechreading
نویسنده
چکیده
This paper describes recent speechreading experiments for a speaker independent continuous digit recognition task. Visual feature extraction is performed by a lip tracker which recovers information about the lip shape and information about the greylevel intensity around the mouth. These features are used to train visual word models using continuous density HMMs. Results show that the method generalises well to new speakers and that the recognition rate is highly variable across digits as expected due to the high visual confusability of certain words.
منابع مشابه
Linear discriminant analysis for speechreading
This paper investigates the use of Fisher-Rao linear discriminant analysis (LDA) as a means of visual feature extraction for hidden Markov model based automatic speechreading. For every video frame, a three-dimensional region of interest containing the speaker's mouth over a sequence of adjacent frames is lexicographically arranged into a data vector. Such vectors are then projected onto the sp...
متن کاملLipreading by Neural Networks: Visual Preprocessing, Learning, and Sensory Integration
Stanford University Stanford, CA 94305 We have developed visual preprocessing algorithms for extracting phonologically relevant features from the grayscale video image of a speaker, to provide speaker-independent inputs for an automatic lipreading ("speechreading") system. Visual features such as mouth open/closed, tongue visible/not-visible, teeth visible/notvisible, and several shape descript...
متن کاملTactiling: a usable support system for speechreading?
The purpose of this study was to find out whether deafened adults can take advantage of the extra information in speechreading given by the vibrational and motional patterns picked up by placing a hand on a speaker's throat and shoulder, and how valuable this tactile supplement is as a support system for speechreading. We have named this method--speechreading with tactile supplement--tactiling....
متن کاملSpeechreading Using Probabilistic Models Speechreading Using Probabilistic Models
A robust method for locating and tracking lips in gray level image sequences is described The method learns patterns of shape variability from a training set which constrains the model during image search to only deform in ways similar to the training examples Image search is guided by a learned gray level model which is used to describe the large appearance variability of lips Such variability...
متن کاملSpeechreading using shape and intensity information
We describe a speechreading system that uses both, shape information from the lip contours and intensity information from the mouth area. Shape information is obtained by tracking and parameterising the inner and outer lip boundary in an image sequence. Intensity information is extracted from a grey level model, based on principal component analysis. In comparison to other approaches, the inten...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997